Discovering Topical Aspects in Microblogs
نویسندگان
چکیده
We address the problem of discovering topical phrases or “aspects” from microblogging sites like Twitter, that correspond to key talking points or buzz around a particular topic or entity of interest. Inferring such topical aspects enables various applications such as trend detection and opinion mining for business analytics. However, mining high-volume microblog streams for aspects poses unique challenges due to the inherent noise, redundancy and ambiguity in users’ social posts. We address these challenges by using a probabilistic model that incorporates various global and local indicators such as “uniqueness”, “diversity” and “burstiness” of phrases, to infer relevant aspects. Our model is learned using an EM algorithm that uses automatically generated noisy labels, without requiring manual effort or domain knowledge. We present results on three months of Twitter data across different types of entities to validate our approach.
منابع مشابه
Discovering and Tracking Events From News, Blogs and Microblogs on the Web
Using three data sources, news, blogs, and microblogs, this study proposes a framework for discovering and tracking events embedded in free form online text. Existing methods for text mining are discussed for the three sources. Because three sources have different perspective, event analysis, region-topic model and rare keywords are proposed respectively. In order to integrate three data source...
متن کاملMicroblogs Data Management Systems: Querying, Analysis, and Visualization (Tutorial)
Microblogs data, e.g., tweets, reviews, news comments, and social media comments, has gained considerable attention in recent years due to its popularity and rich contents. Nowadays, microblogs applications span a wide spectrum of interests, including analyzing events and users activities and critical applications like discovering health issues and rescue services. Consequently, major research ...
متن کاملEvaluating Ranking Diversity and Summarization in Microblogs using Hashtags
Diversification techniques for web search have recently been developed that assume that, for each query, there is a set of underlying aspects or subtopics that address specific user intents. These techniques attempt to balance the relevance of the retrieved documents with the coverage of the aspects. Evaluation of diversification techniques requires some way of defining a set of aspects for eac...
متن کاملSerendipitous learning: Recognizing and fostering the potential of microblogging
This paper introduces the concept of serendipitous learning in the context of microblogging and discusses the potential of unplanned and unexpected discoveries for learning. Serendipitous learning as a subset of incidental learning refers to learning through gaining new insights, discovering unrevealed aspects and recognizing seemingly unrelated connections. This type of learning can occur by c...
متن کاملSocial Links from Latent Topics in Microblogs
Language use is overlaid on a network of social connections, which exerts an influence on both the topics of discussion and the ways that these topics can be expressed (Halliday, 1978). In the past, efforts to understand this relationship were stymied by a lack of data, but social media offers exciting new opportunities. By combining large linguistic corpora with explicit representations of soc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014